STIC Biotechnology Systems Branch 



RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.2.2 PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS : 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indi cated addresses: 

1 EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 

User Manual - ePAVE) 
2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 223 13-1450 
3 Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 

U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 

Alexandria, VA 22314 

Revised 01/24/05 
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RAW SEQUENCE LISTING DATE: 02/06/2006 

PATENT APPLICATION: US/10/532 , 198 TIME: 14:26:38 

Input Set : A:\ 211010049SEQLIST.TXT 
Output Set: N:\CRF4\02012006\J532198.raw 



4 <110> APPLICANT: Sanchez, Alejandro 

5 Robb, Sofia 

7 <12 0> TITLE OF INVENTION: MANAGING BIOLOGICAL DATABASES 
10 <130> FILE REFERENCE: 21101. 0049U2 

12 <140> CURRENT APPLICATION NUMBER: 10/532,198 

13 <141> CURRENT FILING DATE: 2005-04-21 

15 <150> PRIOR APPLICATION NUMBER: PCT/US2 003/03 3590 

16 <151> PRIOR FILING DATE: 2003-10-22 

18 <150> PRIOR APPLICATION NUMBER: 60/420,216 

19 <151> PRIOR FILING DATE: 2002-10-22 



21 <160> NUMBER OF SEQ ID NOS : 9 D©©§ CofHp|y 



23 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 3 60 y H 

27 <212> TYPE: DNA ^/Mf^ ' *" ^ 

28 <213> ORGANISM: Artificial Sequence (/ " 

30 <220> FEATURE: 

31 <223> OTHER INFORMATION: Description of Artificial Sequence:/ note = 

32 synthetic construct 

34 <220> FEATURE: 

35 <221> NAME /KEY : misc_jf eature 

36 <222> LOCATION: 169 

37 <223> OTHER INFORMATION: n=a, t, c, org 

39 <220> FEATURE: 

40 <221> NAME/KEY: misc_f eature 

41 <222> LOCATION: 276 

42 <223> OTHER INFORMATION: n=a, t, c, or g 

44 <400> SEQUENCE: 1 

45 cttctgcaag gtccacagtt accattaaca aatcgagtcc tgcagtaaag gaatatgaaa 60 

46 tgagacaatc ctacaatttt tctggagcac ctatgggggg atcagttcaa attcatagca 12 0 , 

47 atgtgtcttc tgctgtagag ggccgagaaa gagaaaagag agaaa tgcna gatcttaatg 180 ^) -^JIj^ 

48 aaaggctagc taattatatt gaaaaggtaa gatttctaga agctc^aac aaaagattaa 240 / ^ , 

49 caaatgaatt gaatacgtta cgtgaaagat ggggtnaaga agctgaaagg atacgagctt 3 00 /njUU*-*- 

50 tatatgagat tgaaatggat caattgaaaa agttattaga cgaagctgaa gctgctagat 3 60 SL/^JL&^t-edL**^ 

52 <210> SEQ ID NO: 2 ^>^^>^^ 

53 <211> LENGTH: 60 

54 <212> TYPE: PRT 

55 <213> ORGANISM: Artificial Sequence 

57 <220> FEATURE: 

58 <223> OTHER INFORMATION: Description of Artificial Sequence:/ note = 

59 synthetic construct 

61 <220> FEATURE: 

62 <221> NAME/KEY: VARIANT 
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RAW SEQUENCE LISTING DATE: 02/06/2006 

PATENT APPLICATION: US/10/532,198 TIME: 14:26:38 

Input Set : A:\ 211010049SEQLIST.TXT 
Output Set: N:\CRF4\02012006\J532198.raw 

63 <222> LOCATION: 2^ 48 

64 <223> OTHER INFORMATION: Xaa = any amino acid 

66 <400> SEQUENCE: 2 

67 Asn Xaa Ser Ser Pro Ala Val Lys Glu Tyr Glu Met Arg Gin Ser Tyr 

68 1 5 10 15 

69 Asn Phe Ser Gly Ala Pro Met Gly Gly Ser Val Gin lie His Ser Asn 

70 20 25 30 

71 Val Ser Ser Ala Val Glu Gly Arg Glu Arg Glu Lys Arg Glu Met Xaa 

72 35 40 45 

73 Asp Leu Asn Glu Arg Leu Ala Asn Tyr lie Glu Lys 

74 50 55 60 

76 <210> SEQ ID NO: 3 

77 <211> LENGTH: 60 

78 <212> TYPE: PRT 

79 <213> ORGANISM: Artificial Sequence 

81 <220> FEATURE: 

82 <223> OTHER INFORMATION: Description of Artificial Sequence:/ note = 

83 synthetic construct J\ JL A ' rjb*o* 

85 <220> FEATURE: ^ 00 ^S fti K £M S^TL&^" 

86 <221> NAME /KEY: \^RI^IiT 

87 <222> LOCATION: ntjJoT 13, 14, 17, 18, 20, 21, 42^, 46", 47-49, 52) 54, 59 

88 <223> OTHER INFORMATION: Xaa = any amino acid 

90^U0> SEQUENCE: 3 III I V \ 

9l( v AsnjXaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Met Xaa Xaa Ser Tyr 

92S^ 5 ^ ^ 10^ 15 



93 Xaa Xaa Ser Xaa Xaa Pro( Xaa Xaa X aa) Ser (Xa a Xaa Xa a; His SeiQCaa^y fcjj Jjl/Xjf^^ 1 ^ 

iaa^Gly Arg Glu Xaa Glu Lys Xaa Glu Xaa Xaa ^ ^ "2- ^ 



94 ^ 21T 30 

95 Val Xaa Xaa Xaa Xaa Xaa 

96 V 



97 Xaa Leu Asn Xaa Arg Xaa Ala Asn Tyr lie Xaa Lys /n./ /is* 

inn ^9ins cpo m wn- a -V / r ' 



100 <210> SEQ ID NO: 4 ■ / 

101 <211> LENGTH: 60 

102 <212> TYPE: PRT 

103 <213> ORGANISM: Artificial Sequence 

105 <220> FEATURE: 

106 <223> OTHER INFORMATION: Description of Artificial Sequence:/ note 

107 synthetic construct 

109 <400> SEQUENCE: 4 

110 Asn Gin Asn Ala Ser Ser lie Arg Thr lie Glu Met Lys Lys Ser Tyr 

111 15 10 15 

112 Gly Val Ser Ala Thr Pro Gly Ala Thr Ser Asn He Val His Ser Gly 

113 20 25 30 

114 Val Asn Asn Leu Met Asn Gly Arg Glu Lys Glu Lys Asn Glu Leu Gin 

115 35 40 45 

116 Glu Leu Asn Asp Arg Phe Ala Asn Tyr He Asp Lys 

117 50 55 60 

119 <210> SEQ ID NO: 5 

120 <211> LENGTH: 60 



^7 -Y£ S2, 
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RAW SEQUENCE LISTING DATE: 02/06/2006 

PATENT APPLICATION: US/10/532 , 198 TIME: 14:26:39 

Input Set : A:\ 211010049SEQLIST.TXT 
Output Set: N:\CRF4\02012006\J532198.raw 

121 <212> TYPE: PRT 

122 <213> ORGANISM: Artificial Sequence 

124 <220> FEATURE: 

125 <223> OTHER INFORMATION: Description of Artificial Sequence:/ note = 

126 synthetic construct 

128 <220> FEATURE: 

129 <221> NAME /KEY : VARIANT ' 

130 <222> LOCATION: 7 

131 <223> OTHER INFORMATION: Xaa = any amino acid 
133 <400> SEQUENCE: 5 

-> 134 Val Arg Phe Leu Glu Ala Xaa Asn Lys Arg Leu Thr Asn Glu Leu Asn 



2^7 



135 l 5 /*\ 10 15 

136 Thr Leu Arg Glu Arg Trp Gl Vxaa) Glu Ala Glu Arg lie Arg Ala Leu 

137 20 25 30 

138 Tyr Glu lie Glu Met Asp Gin Leu Lys Lys Leu Leu Asp Glu Ala Phe 

139 35 40 45 

140 Ala Ala Arg Ser Glu Leu Leu Pro Lys lie Asn Lys 

141 50 55 60 

143 <210> SEQ ID NO: 6 

144 <211> LENGTH: 60 

145 <212> TYPE: PRT 

146 <213> ORGANISM: Artificial Sequence 

148 <220> FEATURE: 

149 <223> OTHER INFORMATION: Description of Artificial Sequence:/ note = 

150 synthetic construct 

152 <220> FEATURE: 

153 <221> NAME/KEY: VARIANT 

154 <222> LOCATION: X 6-7, 13, 17; 19^, 2,0-21, 2^ 26-27, 30', 34-36, 38, 41-43, 

155 -46, 50-53, 55-56, ^60 

156 <223> OTHER INFORMATION: Xaa = any amino acid 

158 <400> SEQUENCE: 6 

159 Val Arg Xaa Leu Glu Xaa Xaa Asn Lys Arg Leu Thr Xaa Glu Leu Asn 

160 1 5 10 15 

161 Xaa Leu Xaa Xaa Xaa Trp Gly Xaa Glu Xaa Xaa Arg lie Xaa Ala Leu 

162 20 25 30 

163 Tyr Xaa Xaa Xaa Met Xaa Gin Leu Xaa Xaa Xaa Leu Asp Xaa Ala Glu 

164 35 40 45 

165 Ala Xaa Xaa Xaa Xaa Leu Xaa Xaa Lys lie Asn Xaa 

166 50 55 60 

168 <210> SEQ ID NO: 7 

169 <211> LENGTH: 60 

170 <212> TYPE: PRT 

171 <213> ORGANISM: Artificial Sequence 

173 <220> FEATURE: 

174 <223> OTHER INFORMATION: Description of Artificial Sequence:/ note = 

175 synthetic construct 

177 <400> SEQUENCE: 7 

178 Val Arg Ser Leu Glu Asp Glu Asn Lys Arg Leu Thr Asp Glu Leu Asn 

179 15 10 15 
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' RAW SEQUENCE LISTING DATE: 02/06/2006 

PATENT APPLICATION: US/10/532 , 198 TIME: 14:26:39 

Input Set : A:\ 211010049SEQLIST.TXT 
Output Set: N:\CRF4\02012006\J532198.raw 

180 Asp Leu Lys Asp Gin Trp Gly Asn Glu Thr Ala Arg lie Lys Ala Leu 

181 20 25 30 

182 Tyr Asp Ser Asp Met Ser Gin Leu Arg Arg Ser Leu Asp Gin Ala Glu 

183 35 40 45 

184 Ala Ser Lys Ala Gin Leu Glu Met Lys lie Asn Thr 

185 50 55 60 

187 <210> SEQ ID NO: 8 

188 <211> LENGTH: 60 

189 <212> TYPE: DNA 

190 <213> ORGANISM: Artificial Sequence 

192 <220> FEATURE: 

193 <223> OTHER INFORMATION: Description of Artificial Sequence:/ note = 

194 synthetic construct 

196 <400> SEQUENCE: 8 

197 cttctgcaag gtccacagtt accattaaca aatcgagtcc tgcagtaaag gaatatgaaa 60 

199 <210> SEQ ID NO: 9 

200 <211> LENGTH: 60 
2 01 <212> TYPE: DNA 

202 <213> ORGANISM: Artificial Sequence 

204 <220> FEATURE: 

205 <223> OTHER INFORMATION: Description of Artificial Sequence:/ note = 

206 synthetic construct 

208 <400> SEQUENCE: 9 

209 tgagacaatc ctacaatttt tctggagcac ctatgggggg atcagttcaa attcatagca 60 
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RAW SEQUENCE LISTING ERROR SUMMARY 

PATENT APPLICATION: US/10/532 , 198 



DATE: 02/06/2006 
TIME: 14:26:40 



Input Set : A:\ 211010049SEQLIST.TXT 
Output Set: N:\CRF4\02012006\J532198.raw 



aase Note; 

5 of n and/or Xaa have been detected in the Sequence Listing, Please review the 
juence Listing to ensure that a corresponding explanation is presented in the <220> 
<223> fields of each sequence which presents at least one n or Xaa. 

2#:1; N Pos . 169,226,276 

3#:2; Xaa Pos. 2,48 r\ 

3#:3; Xaa Pos. 2 ,3,4 , 5 , 6 , 7 , 8 , 9 , 10 , 13 , 14 , 17 , 18 , 20 , 21 ,f23 , 24 , 25 , 27, 28 , 29 , 32 

3#:3; Xaa POS. .^35,36,37,38,42,45,47^,4-9,5^,54^5^^ - 

3#:5; Xaa Pos. ^24 v * 

3#:6; Xaa Pos. V^Jj , 1-7 , 19, 20, 21 , 24, 2-6„-27, 3p , 34', 35 , 36 , 3 f 8 , 41 , 42 , 43 , 4'6 

3#:6; Xaa Pos. 5^51,52,53,55,56,60 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/532 , 198 



DATE: 02/06/2006 
TIME: 14:26:40 



Input Set : A:\ 211010049SEQLIST.TXT 
Output Set: N:\CRF4\02012006\J532198.raw 



L3 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
17 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:1 after pos.:120 
541 Repeated in SeqNo=l 

57 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:2 after pos . : 0 
341 Repeated in SeqNo=2 

31 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:3 after pos . : 0 
341 Repeated in SeqNo=3 

L34 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:5 after pos . : 0 
341 Repeated in SeqNo=5 

L59 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 after pos . : 0 
341 Repeated in SeqNo=6 
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